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ABSTRACT 



Traditionally, the errors-in-variables problem is concerned 
with the point estimation of the slope of the true scores 
regression line when the regressor is measured with error, and 
V7hen no specification error is present. In this paper the 
errors-in-variables problem is extended to also include 
specification error. As is well known, least squares procedures 
provide a biased estimator of the slope of the true scores 
regression line. Further, it is well known the maximum 
likelihood estimates of the slope (which are consistent) exist 
only once some assumptions are made. In this paper maximum 
likelihood estimates are given for the extended version of the 
errors-in-variables problem (i.e., when specification error is 
present) under the usual assumptions and under several new 
assumptions which are more appropriate for the social and 
behavioral sciences than the previously used assumptions. A 
simulation study is then described. The results of the 
simulation study show that the maximum likelihood estimates (both 
under the old and new assumptions) far outperform the least 
squares procedures when several different criteria (such as bias 
and standard error) are used. 
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REGRESSION SLOPE ESTIMATION WHEN BOTH 
MEASUREMENT AND SPECIFICATION ERROR ARE PRESENT 

One of the problems which has been of great interest over 

the years to econometricians and others is known as the errors- 

in-variables problem. Some of the more complete discussions of 

early work on this problem are M. Brown (1982), Johnston (1972), 

Kendell & Stuart (1973), Madansky (1959) and Moran (1971). 

Recently Fuller (1987) has published a volume devoted solely to 

the errors-in-variable problem that includes the newer, as well 

as the older, work on this problem. The main purposes of this 

paper are to present maximum likelihood solutions to the errors- 

in-variables problem for some situations that have not been 

Investigated by others and to report on a simulation study in 

which these new methods were studeled. For completeness, the 

maximum likelihood solutions presented in the past literature 

will also be given. 

Definition of the Problem 

For ease of discussion, some notation will be introduced 
presently. Let and represent two observed variables of 

it . * 

interest for individual i . Let and represent the values 

of X^ and Y^ if they were measured without error (i.e., the 

true scores or latent scores) ♦ Let arid e« represent the 

^i ^i 

errors of measurement (i.e., the difference between the true 
and observed values) in X and Y, respectively • Let a and 3 
represent the Y -intercept and slope, respectively, of the 

Y on X regression line. Basically the errors-in-variables problem 

is concerned with the point estimation of the slope (and Y -intercept) 
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of the Y on X regression line. Most often discussions of 

the errors-in-variables problem assume that there exists a 

correlation of +1 or -1 between X* and Y* (M. Brown, 1982; 

DeGracie & Fuller, 1972; Fuller & Hidiroglou, 1978; Johnson, 

1972; Kendall & Stuart, 1973; Madansky, 1959; Moran, 1971; 

Sprent, 1966; Wald, 1940; Lindley, 1953 and others). That is.- 

most discussions assume that there is no specification error 

present in the statement of the relationship between X* and 
* 

Y . The assumption of no specification error is, however, 
very unrealistic for most, if not all educational, psycho- 
logical, and other behavioral science applications. Further, 
the assumption of no specification error is necessry. Hence, 
the definition of the errors-in-variables problem will be 
extended in this paper to include specification error, a 
fairly thourough search of the literature revealed few sources 
where specification errors were considered. The only sources 
found were Cochran (1968) and Rock, Werts, Linn, and Joreskog 
(1977). Neither Cochran nor Rock, et. al., however, discuss 
explicit solutions to the errors-in-variables problem when 
specification error is added. Cochran only talks about the 
errors-in-variables problem in relationship to ANCOVA and 
Rock et. al. only talk in general terms about how, once enough 
identifying or overidentif ying restrictions are made, LISREL 
(Joreskog & Sorbom, 1981) can be used to obtain the values of 
the desired maximum likelihood estimates. 
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The errors-in-variablss problem can then be expressed 
symbolically as follows: 

Find a consistent point estimator of 3* when 

y * * * 

Y. = a + 6 .X. + e^ , (1) 

i 

= \ ' (2) 

and 

Yi = Y- + , (3) 

1 

where e^ is the specification error for individual i. 
i 

Further, it is assumed that the vector (X ,Y ,e„,e^,e ) has 

a multivariate normal distribution with mean vector ,Q 

and with a variance-covariance matrix all of whose off-diagonal 
elements (i.e., the covariances) except a** (the covariance 

X Y 

* * 

of X and Y ) are zero and whose diagonal elements (i.e., the 

variances.) are in order symbolized by a^.*, o}*, , , and . 

X y ejj ey eg 

Background 

There are many educational and other behavioral science 
situations in which the estimation of 3* is desirable. The 
author perused several recent years of American Educational 
Research Journal and of Journal of Educational Research . Many 
examples were encountered where the authors estimated 3* for a 
variety of variables, X and Y. All, however, used the least 
squares estimator of 3*. 

The usual least squares estimate of 3, the slope of the 
y on X regression line when X is measured without error, is 
given by 3^.^ = _fxY_ , where s^^^ and are the sample covariance of 

ERLC g 
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X and Y and the variance of Y, respectively. It is well known 

^^^^ ^LS ^ consistent estimator of g , since E(e ) = 

LS 

(Berkson, 1950; Johnston, 1972; Lindley, 1947; and 

others). Hence, it has been suggested that maximum likelihood 
estimation or other techniques be used instead. In order to 
compute the maximum likelihood estimates of a*and it is 
necessary to get expressions for the population means, variances, 
and covariances of the observed variables in terms of e*-.;arid 
other parameters of interest. Hence 

= ^X ' (4) 
My = a + , from equation (1) ; (5) 

2 2x 2 ^ 
c^X ' equation (2) ; (6) 

X 

= (e*)2.a^* + al + a2 , (7) 

Y ®s 

from equations (1) and (3); 

and ^ 

^XY = ^ ' (8) 

where c^^ is the population covariance of X and Y. 
Equation (8) can be derived as follows: 

^XY Cov{X,Y) 

= Cov(X + e^, Y + ey) [from equations (2) and 

= Cov(X*,Y*) + Cov(X*,ey) + Cov(ejj,Y*) + Cov{e^,e^) . 
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Next, by assumption, Cov(X*,eY), Cov{e^,Y*) , and Cov(e^,e^) 
are all zero. Hence a^^ = Cov(X*,Y*) . Therefore, 

°XY Cov(X*,Y*) 

* * * * 
= Cov(x ,a + e .X + e^) from equation (1) 

= Cov{X*,a*) + Cov(X*,e*-X*) + Cov(X*,e ) 

s 

. * * 

= e •cov(x ,x ) , 

Since Cov(X ^e^) is zero by assumption and since Cov(X ) = 0/ 
because a is a constant. Consequently, a = 3 -o^* . 

Equations (6) and (7) are derived similarly. 

As is widely known (e.g. Kendall & Stuart, 1973; Mood, 
Graybill, & Boes, 1974), the maximum likelihood estimates of 
^X' ^Y' ^x' ^Y' ^3:y given by X, Y, S^, S^, and S^^ ' 

respectively, when it is assumed that the joint distribution 
of X and Y is multivariate normal. Theoretically, the maximum 
likelihood estimates of the unknown parameters (i.e., a*, B*, y , 

Oy* , , and ) on the right hand side of the system of 
A e^ 

equations (4) to (8) are then computed by solving the system 

for these unknown parameters in terms of the maximum likelihood 

estimates of y^, a^, o^, and o^^ (Mood, Graybill, & Boes, 

1974) . But, since there are only five equations in six un- 
knowns, an infinite number of solutions exist. The presence 
of an infinite number of solutions is not, however, of ^ 
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practical use. Hence, it is necessary to make additional 
assumptions so that an infinity of solutions do not exist. 

Previous Solutions 

Some assumptions that have been made in the past litera- 
ture are: 

Assumption A: is known and = o ; 



Assumption B: ol is known and = o : 

e_.. £i ^ r 



^ s 



Assumption C: The ratio is known and = o • 

and 

o| 

Assumption D: The ratio is known and = o . 

®X ®s 

Assumption C is equivalent to assuming that the reliability 

of X is known. The reliability of a measure, say X, is defined 

as the ratio of true score variance to observed score variance 

and is denoted by p^^^^. To see the equivalence of Assumption C 

and p^^ being known, let k = — ^ — . Then p = ^ = 

o^* + ~ k + 1 ' dividing the numerator and denomi- 

X 

nator by . The maximum likelihood estimates of 6* under 
X 
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Assumptions A, B, C, and D are: 






under Assumption A, B* = ——^ (Kendall & Stuart, 1973). 


(9) 




- ^e 

Under Assumption B, B* = ^ (Kendall & Stuart, 1973) . 

XY 






r, ^ . ^* (k + 1) .S„^ S^„ - 

HndPT- a^cnitipi-ir^n r 0 - XY _ XY 1 g 


(10) 




(Johnston, 1972) . 






Under Assumption D, B* = ~ ^'^x ^^Y ~ ^'^x^^ 4X-(Sy^)^ 






2-SxY 






(Kendall & Stuart, 1972). Under all r^f these assumptions and 






under the new assumptions to be discussed later, a = Y - B**X . 






For Assumption A, when an estimate of is available instead 

®X 






of being known, DeGracie and Fuller (1972) have derived a 
X 






consistent non-maximum likelihood estimator of B*. 






Interestingly, the assumption that = 0 is not needed 






in order to obtain the maximum likelihood estimates of B* when 






either is known (i.e.. Assumption A) or when is known 
X ■XX 






(i.e.. Assumption C) . The reason is that if one sets the value 
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of equal to some arbitrary constant the maximum likelihood 
s 

estimates of 6* can still be calculated in both these cases, 

since setting equal to some arbitrary value allows the 

s 

system of equations (4) to (8) to be solved for B*. Further, 

no matter what this arbitrary value of is, the maximum 

s 

. . * 

likelihood estimate of B is the same as in equation (9) 

[when is known] or as in equation (10) [when p^^ is known]. 



Since these estimates do not change as changes, the max- 

imum likelihood estimates in these two cases when is 

e 

s 

unknown are still given by equation (9) and equation (10). 

New Solutions 

Before giving the maximum likelihood solutions to the 
errors-in-variables problem under situations which have not 
been discussed in the past literature, a listing of the 
possible assumptions that can made concerning the parameters 

2 2 2 9 

' ' ' ^^^^ given. These possible 

X s 

assumptions are: 

Assumption 1: is known , 

Assumption 2: is known , 

Assumption 3: The ratio — ^ is known (i.e., p^^ is known). 
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Assumption 4: The ratio ^ is known , 

Assumption 5: is known , 



e 
s 



Assumption 6: + is known , 



e e 
s 



0? + o2 



e e 

Assun.ption 7: The ratio ^ ^ — is known , 

e 

Assumption 8: The ratio ^ is known. 

Assumption 9: The ratio ^ is known. 

Assumption 10: o^* is known. 

Assumption 12: p^^ is known (i.e., the ratio is known), 

and 

Assumption 11: p^^ = . ®Y 

-.t is realized that some of these assumptions are more 
plausible than others. The relative plausibility of the various 
assumptions will differ across data analysis situations. It 
is interesting to note that all of the assumptions except 
Assumption U actually require knowledge on the part o^ the 
d.7- .alyst. Assumption H, however, only requires believa- 
•ven though Assumptions 1 «> 10, & 12 are stated in 
~' is known, it is realized that in actual 
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data analysis situations the analysts will be providing their 

best guess at the correct value of a particular quantity rather 

than knowing the correct value. In the remainder of this 

paper • it will be assumed nevertheless that the correct values 

of particular quantities a^e known rather than guessed at. 

The solutions under Assumption 1 or under Assumption 3 

have already been discussed when it was shown that the =0 

e 

s 

part of Assumptions A and C is not necessary. Solutions can 
also be derived when either Assumption 6, 7, or 10 is used. 
When any of the remaining assumptions (i.e., Assumptions 2, 
4f 5^ 3/ 9, 11, or 12) is taken singly, an infinite number of 

solutions for 3 , in terms of aj, o^, o^^, and the particular 

assumption taken, still exist. Hence, maximum likelihood 
* 

estimates of 3 are not available' when any of the Assumptions 
2r 4, 5, 8, 9, 11, or 12 is taken singly. Solutions can be 
derived, however, when any combination of two of these assump- 
tions, except 4 and 11 or 2 and 12, is jointly assximed. 
When Assvimptions 4 and 11 or Assumptions 2 and 12 are jointly 
assumea an infinite number of solutions for 6 still exist, 
and hence maximum likelihood estimates still do not exist. 
Only elementary algebra is necessary to derive the maximum 
likelihood estimates, even though in some cases the derivations 
are quite tedious. Hence the author has chosen in this paper to 
give only the maximum likelihood estimates and eliminate the 
derivations of these estimates. 
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Table 1 gives these maximum likelihood estimates of R under 
the various assumptions* in this table when more than one set of 
assumptions are listed together in the first column (e.g., 
3 alone; 11 and 12 jointly), this means that each of these sets 
of assumptions generates the same solution for ft* . when two or 
more expressions for ^ are given together in the second column 
and are connected by a ; , this means that these expressions are 
two different "computational" formulae for the same estimator. 



Insert Table 1 about here 



Set"Up of Simulation Study 

A simulation study was performed that involved 1000 data 

sets at each of three levels of data reliability crossed with 

three levels of sample sizes. This simulation study is only a 

first step in the comparison of the various estimators proposed 

in the last section. As will be discussed later, more remains to 

be done. In order to be able to compare the properties of the 

various estimators, the generated data sets simultaneously met 

the Assumptions 1 through 12. For all data sets, without loss of 

2 

generality, d^^ was set to 1, and then a was set to 1/4. 

Since, as was stated earlier, no previous work is known to the 

author where specification error was included, the choice of a 

e 

as being equal to 1/4 of cr* was made because even in those 

X 

social and behavioral science situations where errors of 
measurement are negligible, and where a linear relationship makes 
theoretical sense, the r 's seem to often be around .75, 
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leaving approximately one-quarter of the variance being due to 
specification error. For this simulation study the values of a = 
0. Makincj a = 0 was done in order to be able to focus on the 

^ it 

estimation of 15 . Arbitrarily, R was set equal to 1.5 . This 



simulation study needs, however, be repeated in the future with 

* A 2 



different values of fl . Once fl is set equal to 1.5 and a^* = 1, 



it follows that a^* = 2.25 . 

The three levels of data reliability chosen for this study 
were .5, .7, and .9 . In order to be able to compare across the 
various assumptions, each reliability value represents the common 
value of and p^y • Once the reliability value is chosen the 

remainder of simulation population values can be explicitly 

* 2 

determined since they are functions of fl , c^* , p^^^ , and pyy • 

These values are: 

Value when Value when Value when 
Parameter p=.5 p=.7 p=.9 



2 

a 

2 

a 



3/7 1/9 



9/4 27/28 1/4 



The sample sizes chosen for this study were 25, 50, and 100, 
since these are typical sample sizes in the social and behavioral 
sciences . 

The first stage of the simulation study was to generate 
pseudo-random standard normal deviates. The second stage of the 
simulation involved estimating the Estimators 1 through 15 and 
the usual least squares estimator for 1000 data sets based on 
pairs of observations, X and Y, that were generated using the 
pseudo-random deviates generated at the first stage. For each 
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pair of observations, X and Y, (i.e., simulated individuals) four 
standard normal deviates were needed. The first deviate 
represented X . The second and third deviates were multiplied by 
the appropriate values to give c and e . The fourth deviate 

.A. Y 

was multiplied by .25 to give e . Each deviate was generated 

s, 

separately by entering four different seeds simultaneously into 
the RANNOR pseudo-random standard normal generator in SAS Version 
5.1 as implemented on the VAX computer at Winona State University 
running under the VMS operating system. Due to internal 
limitations on VAX at Winona State, only 6250 individuals' 
deviates could be generated per SAS run. Four new seeds were 
entered for each of the 16 runs used to generate the 400,000 
deviates needed in this study (1000 simulation runs x 100 
individuals x 4 deviates per individual). The RANNOR generator 
in SAS worked well. The means for the 64 sets of 6250 pseudo- 
standard normal deviates ranged from a low of -.03833002 to a 
high of +.01904790 v/hile the standard deviations ranged from a 
low of .981^0919 to a high of 1.01668405 . The pairwise 
correlations between the four deviates within a run ranged from a 
low of -.02693 to a high of +.03227 . 

Results 

For the reliability values of .5 and .7 all of the maximum 
likelihood estimators studied did better than the usual least 
squares estimator, while for the reliability value of .9 the 
maximum likelihood estimators did better in almost all cases. 
Tables 3 through 11 provide summaries of the simulation results 
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for the various reliability values and sample sizes • Table 2 
provides the key to the estimator numbers used in Tables 3 
through 11* in each table the last column represents the 



Insert Tables 2 through 11 about here 

percentage of times that the estimator being studied is closer to 
the correct theoretical value of 1,5 for fl* than the usual least 
squares estimator is. Although this is not a traditional method 
for comparing estimators, it was felt by the author that this 
column gives extremely valuable information in this situation. 
The column labeled "Mean" in each of Tables 3 to 11 provides the 
observed means for each estimator rather than the theoretical 
values, even though the theoretical values can easily be derived. 
It should be noted that the theoretical values were calculated by 
the author and the observed values for the mean were indeed close 
to the theoretical values. The reason, however, that a 
simulation study was done was to obtain estimates of the standard 
errors of the various estimators and to assess the percentage of 
times each estimator outperformed the usual least squares 
<5Stimator. since observed standard errors and percentages are 
reported in the tables, the observed means are reported simply 
for consistency purposes. 

The first thing that should be noticed from Tables 3 to 11 
is, as would be expected, that as the sample sizes increase 
within a reliability value all of the estimators, with only a few 
exceptions, have decreasing standard errors, less bias (i.e., the 
observed means become closer to the theoretical value of 1.5), 
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and hence increased percentages of times better than least 
squares estimator. Estimators 10, 12, and 13 for the reliability 
of .5; Estimator 5 for the reliability value of .7; and Estimator 
5 for the reliability value of .9 are the exceptions and they 
are exceptions only for the bias of the mean. It should be 
noticed, however, that these estimators have very little bias 
already for samples of size 25. Hence, this nondecrease in the 
values for the means could be caused simply by nested sets of 
random numbers being used for the simulations at the different 
sample sizes (i.e., the 100,000 random deviates used for n = 25 
are a subset of the 200,000 random deviates used for n = 50, 
which in turn are a subset of the 400,000 deviates used for n = 
100. ) 

The second thing to notice is that within a sample size 
(i.e., n = 25, n = 50, or n = 100) in all cases the percentage of 
times that an estimator does better than the usual least squares 
estimator decreases as the reliability of the data increases. 
This is not all that surprising when one remembers that the bias 
m the least squares estimator of ft is heavily dependent on the 
reliability of the data. That is, the bias of the least squares 
estimator is equal to (p - l)*ft* and hence the bias increases 
as the reliability of the data decreases. Further, as would be 
expected, within a sampla size almost always the bias and the 
standard errors of the various estimators decreas.e as the 
reliability increases. The exce|)tions are the bias for 
Estimators 10, 12, and 13 for the sample sizes of n = 25 and 50 
and Estimators 2, 3, 10, 12, and 13 when n = 100. 

18 
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Conclusions 

A final thing to notice across the estimators is that all of 
the estimators except Estimators 3, 5, and 9 have similar 
standard deviations and percentage of times better than the usual 
least squares estimator for a fixed sample size and reliability 
level (except in the case when n = 25 and the reliability value 
is .5). Hence, when several of the assumptions made in 
generating the various estimators seem reasonable simultaneously 
in a particular real-world setting, this simulation study does 
not give very much advice for choosing between the various 
estimators, what can be concluded, however, is that if a variety 
of assumptions fit the real-world setting, then the researcher 
should avoid using Estimators 3, 5, and 9. On the other hand, 
and more importantly, all of the estimators presented here, 
except Estimator 9, do much better than the usual least squares 
estimator. Hence, as long as one of the sets of assumptions 
(except those that lead to Estimator 9) presented in this paper 
holds, the corresponding maximum likelihood estimators should be 
used over the usual least squares estimator. As mentioned above, 
the only estimator that performed badly was Estimator 9. The 
author can not account for this. Perhaps there was an algebraic 
error or a programming error on the author's part, but after 
several checks by the author neither type of error was detected. 
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Limitations and Future Directions 



There are two major limitations to this study. First, only 

one value of R was studied and only one value of a was used 

e 

s 

when studying this value of the true scores regression line 

slope. Hence, further simulation studies need to be performed 

where the values of and are systematically varied. 

s 

Second, this study was performed using only data sets where all 
of the assumptions were met simultaneously. A study needs to be 
done where different assumptions are violated and the effects of 



these violations are studied for the various estimators. 
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Assumptions 
1 



Table 1 

Maximum Likelihood Estimates of g* 
Under Various Assumptions 



Maximum Likelihood Estimate of g* (B*) 



XY 



3 
or 

11 & 12 jointly 



XY 



2 

X 



e + 1 



XY 



X 



Where e = 



X 



6 

or 

2 & 5 jointly 
or 

2 & 8 jointly 
or 

5 & 8 jointly 



^ ®Y 



'S 



XY 



7 , 

or " ^*^X /^^Y ~ ^*^X^ ^*^^^XY^ 

4 & 8 jointly " 
or 

4 & 9 jointly °l + 

Qj. where n= 



8 & 9 jointly 



10 



XY 



X 



continued on next page 
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Table 1 (continued) 



Assumptions 
2 & 4 jointly 



2 & 9 jointly 



Maximum Likelihood Estimate of ft* (B*) 



XY 



XY 



■where x 



2S 



XY 



where 6 = — 

-2 



2 & 11 jointly 



4 & 5 jointly 



2S 



XY 



where A = 



4 « 12 jointly 



5 & 9 jointly 



XY 



sr. -^(iTT~^ 



.2 
'X 



where X = 



YY' Y 



XY 



XY 



where 6 
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Table 1 (continued) 



Assumptio ns 



5 & 11 jointly 



Maximum Likelihood Esf imate of B* ii*) 



2.S 



XY 



5 & 12 jointly 



PyY'Sy - 



XY 



8 & 11 jointly 



y / (Y.S^)H 4- (1 +Y).- ^ 



X 



2-S 



XY 



where T = 



e. 



9 & 11 jointly 



+/(6.s^)2+ 4 



2.S 



XY 



where 6 = ^ 

„2 



continued on next page 
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Table 1 (continued) 



Assumptions 



Maximum Likelihood Estimates of B* (B*) 



8 & 12 jointly 



^YY ^* ^Pyy ~ ^^^^ 



XY 



where y = 



9 & 12 jointly 



•s^ + Ad 



.2 

'Y 



2-S 



XY 



where 6 = 



2 & 12 jointly 
or 

4 fi 11 jointly 



No maximum likelihood estimateo exist 
(not enough information) 
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TABLE 2 

Correspondence Between Estimator Numbers and Assumptions 



Estimator # Assumption.^ 

1 1; 2 & 4 jointly; 5 & 9 jointly 

2 3; li & 12 jointly 

3 6; 2 & 5 jointly; 2 & 8 jointly; 5 & 8 jointly 

4 7; 4 & 8 jointly; 4 & 9 jointly; 8 & 9 jointly 

5 10 

6 2 & 9 jointly 

7 2 & 11 jointly 

8 4 & 5 jointly 

9 4 & 12 jointly 

10 5 & 11 jointly 

11 5 & 12 jointly 

12 8 & 11 jointly 

13 9 & 11 jointly 

14 8 £ 12 jointly 

15 9 & 12 jointly 
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TABLE 3 

Maximum Likelihood Estimates 
Using n = 25 and P = .5 



Estimator 
Number 

Least Squares 

1 

2 

3 

4 

5 

6 

7 

8 

9 
10 
11 
12 
13 
14 
15 



Mean 
0.78739 
2.32623 
1.57478 
1.73922 
1.77828 
1.51421 
1.75161 
2.02469 
1.96543 

1.51160 
1.90996 
1.51126 
1.51479 
1.94126 
1.86877 



Standard Error 

0.31837 

6.05184 

0.63674 

3.66971 

2.35811 

0.72259 

3.40772 

3.95133 

1.84852 
* 

0.38382 
2.69919 
0.37071 
0.40089 
2.81998 
2.57002 



Percentage of Times 
Better Than Least Squares 



63.4% 

75.8% 

58.3% 

75.9% 

75.4% 

66.0% 

70.3% 

68.0% 
* 

88.5% 
69.8% 
88.7% 
87.9% 
69. 2^ 
76.1% 



Note: The author made a formatting error when printing out the values of 
Estimator 9 for the 1000 genere .:ed data sets. This error was caused by the 
estimator taking on values less than -9.999995 or greater than 99.999995 . 
Hence, summary statistics are not reported here for Estimator 9. 
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TABLE 4 



Maximum Likelihood Estimates 
Using n = 50 and p = .5 



Estimator 
Number - 

Least Squares 
1 

2 ' 

3 

4 

5 

6 

7 

8 

9 
10 
11 
12 
13 
14 
15 



Mean 

0.77055 

1.81852 

1.54109 

1.54068 

1.59450 

1.51537 

1.55717 

1.64410 

1.79726 

2.97829 

1.51545 

1.62225 

1.51488 

1.51639 

1.63041 

1.62036 



Standard Error 
0.21214 
1.91392 
0.42427 
0.75099 
0.60344 
0.50059 
0.70521 
0.48278 
0.71365 
4.88477 
0.21600 
0.67088 
0.20835 
0.23032 
0.67880 
0.62184 



Percentage of Times 
Better Than Least Squares 



76.7% 

86.8% 

79.3% 

86.6% 

85.2% 

84.9% 

87.3% 

79.4% 

52.9% 

96.0% 

89.5% 

96.4% 

95.7% 

89.0% 

92.0% 
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TABLE 5 



Maximum Likelihood Estimates 
Using n = 100 and p = .5 



Estimator 
Number 

Least Squares 
1 

2 

3 

5 

6 

7 

8 

9 
10 
11 
12 
13 
14 
15 



Mean 
0.75573 
1.61274 
1.51146 
1.51250 
1.54117 
1.49434 
1.52162 
1.56125 
1.73606 
2.68236 
1.50831 
1.55318 
1.50822 
1.50912 
1.55724 
1.55315 



Standard Error 
0.13974 
0.49832 
0.27947 
0.41903 
0.31076 
0.34833 
0.38473 
0.25456 
0.35537 
3.99031 
0.14386 
0.31720 
0.13731 
0.15131 
0.31735 
0.29047 



Percentage of Times 
Better Than Least Squares 



88.4% 

95.9% 

93.2% 

95.3% 

94.5% 

94.8% 

96.9% 

89.3% 

58,0% 

99.3% 

97.4% 

99.4% 

99.3% 

97.5% 

98.3% 
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TABLE 6 

Maximum Likelihood Estimates 
Using n 25 and p = .7 



Estimator 
Number 

Least Squares 

1 

2 

3 

4 

5 

6 

7 

8 

9 
10 
11 
12 
13 
14 
15 



Mean 
1.09879 
1.74903 
1.56970 
1.51658 
1.59520 
1.50493 
1.54858 
1.62745 
1.67538 
2.30126 
1.53710 
1.60173 
1.54076 
1.54411 
1.62160 
1.61040 



Standard Error 
0.28160 
0.91796 
0.40228 
0.61446 
0.50276 
0.55060 
0.55868 
0.32913 
0.62468 
3.42575 
0.26942 
0.82520 
0.26216 
0.28501 
0.89816 
0.78194 



Percentage of Times 
Better- Than Least Squares 



64.9% 

69.9% 

62.6% 

72.1% 

56.8% 

72.2% 

69.4% 

69.0% 

55.3% 

78.7% 

68.8% 

77.6% 

76.8% 

67.7% 

76.0% 
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TABLE 7 

Maximum Likelihood Estimates 
Using n = 50 and p = .7 



Estimator 
Number 

Least Squares 

1 

2 

3 

4 

5 

6 

7 

8 

9 
10 
11 
12 
13 
14 
15 



Mean 
1.07564 
1.59831 
1.53663 
1.49710 
1.54091 
1.50480 
1.51384 
1.56004 
1.61502 
1.86029 
1.51886 
1.52707 
1.52117 
1.52254 
1.53406 
1.53381 



Standard Error 
0.18404 
0.35701 
0.26290 
0.32149 
0.27622 
0.37540 
0.29097 
0.20159 
0.28937 
0.69816 
0.18349 
0.27130 
0.17718 
0.19391 
0.27147 
0.24538 



Percentage of Times 
Better Than Least Squares 



77.1% 

81.8% 

79.8% 

82.6% 

75.0% 

84.6% 

85.0% 

78.4% 

61.6% 

87.8% 

86.3% 

87.8% 

86.9% 

86.6% 

88.1% 
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TABLE 8 

Maximum Likelihood Estimates 
Using n = 100 and P = .7 



Estimator 
Number 

Least Squares 

1 

2 

3 

4 

5 

6 

7 

8 

9 
10 
11 
12 
13 
14 
15 



Mean 
1.05882 
1.54251 
1.51260 
1.49829 
1.51980 
1.49121 
1.50696 
1.52788 
1.59659 
1.74638 
1.50877 
1.51632 
1.50986 
1.51072 
1.52053 
1.51936 



Standard Error 
0.11967 
0.21049 
0.17097 
0.22088 
0.17851 
0.26691 
0.19677 
0.12628 
0.18871 
0.35558 
0.12275 
0.17821 
0.11724 
0.12821 
0.17641 
0.16030 



Percentage of Times 
Better Than Least Squares 



89.4% 
93.5% 
92.2% 
93.0% 
89.3% 
93.8% 
95.8% 
88.7% 
69.8% 
97.0% 
95.8% 
97.1% 
96.9% 
96.6% 
96.5% 
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TABLE 9 

Maximum Likelihood Estimates 
Using n = 25 and p = .9 



Estimator 
Number 

Least Squares 

1 

2 

3 

4 

5 

6 

7 

8 

9 
10 
11 
12 
13 
14 
15 



Mean 
1.40986 
1.59323 
1.56651 
1.49154 
1.55339 
1.50040 
1.53634 
1.57857 
1.55575 
1.63639 
1.53345 
1.50881 
1.54617 
1.54838 
1.52436 
1.54374 



Standard Error 
0.20824 
0.25242 
0.23136 
0.22689 
0.21167 
0.45821 
0.20634 
0.20509 
0.20939 
0.29926 
0.18688 
0.21678 
0.18711 
0.19755 
0.22482 
0.20053 



Percentage of Times 
Better Than Least Squares 



52.4% 

54.6% 

52.3% 

56.3% 

28.7% 

59.3% 

53.8% 

57.1% 

48.6% 

60.5% 

55.6% 

58.0% 

57.4% 

57.1% 

60.1% 
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TABLE 10 



Maximum Likelihood Estimates 
Using n = 50 and P = .9 



Estimator 
Number 

Least Squares 

1 

2 

3 

4 

5 

6 

7 

8 

9 
10 
11 
12 
13 
14 
15 



Mean 
1.38100 
1.54664 
1.53445 
1.49482 
1.52760 
1.49769 
1.51866 
1.54020 
1.53799 
1.59003 
1.51740 
1.50245 
1.52409 
1.52523 
1.50932 
1.52176 



Standard Error 
0.13626 
0.16308 
0.15140 
0.15243 
0.14528 
0.31291 
0.14212 
0.13428 
0.14339 
0.19261 
0.12884 
0.13856 
0.12758 
0.13565 
0.13476 
0.13467 



Percentage of Times 
Better Than Least Squares 



62.3% 
64.2% 
62.3% 
64.7% 
33.9% 
66.4% 
64.0% 
63.7% 
55.5% 
67.6% 
64.7% 
66.3% 
65.5% 
65.2% 
67.0% 
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TABLE 11 

Maximum Likelihood Estimates 
Using n = 100 and p - .9 



Estimator 
Number 

Least Squares 

1 

2 

3 

4 

5 

6 

7 

8 

9 
10 
11 
12 
13 
14 
15 



Mean 
1.36317 
1.52139 
1.51463 
1.49743 
1.51322 
1.48945 
1.50909 
1.51856 
1.52881 
1.56321 
1.50804 
1.50233 
1.51116 
1.51189 
1.50674 
1.51119 



Standard Error 
0.08547 
0.10001 
0.09500 
0.10769 
0.09513 
0.22639 
0.09564 
0.08238 
0.09681 
0.12001 
0.08598 
0.09719 
0.08287 
0.08852 
0.0938S 
0.09009 



Percentage of Times 
Better Than Least Squares 



74.1% 

74.8% 

75.4% 

76.6% 

46.4% 

76.6% 

75.9% 

72.8% 

63.5% 

78.1% 

77.1% 

78.3% 

77.5% 

78.0% 

77.6% 
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